A formulation of the autoregressive HMM for speech synthesis
نویسندگان
چکیده
We present a formulation of the autoregressive HMM for speech synthesis and compare it to the standard HMM synthesis framework and the trajectory HMM. We give details of how to do efficient parameter estimation and synthesis with the autoregressive HMM and discuss consequences of the autoregressive HMM model. There are substantial similarities between the three models, which we explore. The advantages of the autoregressive HMM are that it uses the same model for parameter estimation and synthesis in a consistent way, in contrast to the standard HMM synthesis framework, and that it supports easy and efficient parameter estimation, in contrast to the trajectory HMM.
منابع مشابه
Autoregressive HMMs for speech synthesis
We propose the autoregressive HMM for speech synthesis. We show that the autoregressive HMM supports efficient EM parameter estimation and that we can use established effective synthesis techniques such as synthesis considering global variance with minimal modification. The autoregressive HMM uses the same model for parameter estimation and synthesis in a consistent way, in contrast to the stan...
متن کاملAutoregressive clustering for HMM speech synthesis
The autoregressive HMM has been shown to provide efficient parameter estimation and high-quality synthesis, but in previous experiments decision trees derived from a non-autoregressive system were used. In this paper we investigate the use of autoregressive clustering for autoregressive HMM-based speech synthesis. We describe decision tree clustering for the autoregressive HMM and highlight dif...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملThe Effect of Using Normalized Models in Statistical Speech Synthesis
The standard approach to HMM-based speech synthesis is inconsistent in the enforcement of the deterministic constraints between static and dynamic features. The trajectory HMM and autoregressive HMM have been proposed as normalized models which rectify this inconsistency. This paper investigates the practical effects of using these normalized models, and examines the strengths and weaknesses of...
متن کاملSeparation of Voiced Source Charac Transfer Function Characteristics Fo Analysis Based on Ar-h
A new method was developed for the separation of source and transfer function characteristics of speech sounds, with an aim of utilizing it to “flexible” speech synthesis. The method is based on representing source waveform by an HMM, and transfer function by the AR process (AR-HMM model). As compared to methods based on ARX model, where a parametric representation is assumed for source wavefor...
متن کامل